Multi modal multi-semantic image retrieval
نویسنده
چکیده
.................................................................................................................... viii ACKNOLWEDGEMENTS .................................................................................................... x ABBREVIATIONS ................................................................................................................. xi CHAPTER 1 INTRODUCTION ......................................................................................... 1 1.1 Motivation ....................................................................................................................... 2 1.1.1 Representing Visual Content and Classifying Images ........................................... 3 1.1.2 Ambiguity of Natural Language in Text Captions ................................................ 4 1.1.3 Use of Hybrid Visual and Textual Metadata Models ............................................ 5 1.2 Research Objectives ........................................................................................................ 6 1.3 Structure of this Thesis .................................................................................................... 8 CHAPTER 2 FUNDAMENTALS ...................................................................................... 10 2.1 Main Processes for IMR ................................................................................................ 10 2.2 Content-Based Image Retrieval (CBIR) ........................................................................ 13 2.2.1 Global Features .................................................................................................... 14 2.2.2 Local Features ..................................................................................................... 15 2.3 Semantic-Based Image Retrieval (SBIR) ...................................................................... 16 2.3.1 Knowledge (Ontology-based) Representation Techniques ................................. 18 2.3.2 Advantages of Using Ontologies for IR .............................................................. 21 2.4 MPEG-7 and Ontology-based KB ................................................................................. 21 2.5 Summary ........................................................................................................................ 25 CHAPTER 3 SURVEY AND ANALYSIS OF THE STATE-OF-THE-ART FRAMEWORKS ......................................................................................... 26 3.1 Problem Analysis of the Image Retrieval Systems ........................................................ 26 3.2 Formal Requirements of Image Retrieval Systems ....................................................... 28 3.3 Survey of State of the Art Frameworks ......................................................................... 29 3.3.1 Ontology-Based KB Frameworks for IMR ......................................................... 29 3.3.2 Visual Features-Based Frameworks for Visual Content Representation ............. 34 3.4 Discussion ...................................................................................................................... 39 3.5 Summary ........................................................................................................................ 42
منابع مشابه
Public Transport Ontology for Passenger Information Retrieval
Passenger information aims at improving the user-friendliness of public transport systems while influencing passenger route choices to satisfy transit user’s travel requirements. The integration of transit information from multiple agencies is a major challenge in implementation of multi-modal passenger information systems. The problem of information sharing is further compounded by the multi-l...
متن کاملImage Retrieval: Content versus Context
In this paper, we introduce a new approach to image retrieval. This new approach takes the best from two worlds, combines image features (content) and words from collateral text (context) into one semantic space. Our approach uses Latent Semantic Indexing, a method that uses co-occurrence statistics to uncover hidden semantics. This paper shows how this method, that has proven successful in bot...
متن کاملExploiting Multimedia Content: a Machine Learning Based Approach
This thesis explores use of machine learning for multimedia content management involving single/multiple features, modalities and concepts. We introduce shape based feature for binary patterns and apply it for recognition and retrieval application in single and multiple feature based architecture. The multiple feature based recognition and retrieval frameworks are based on the theory of multipl...
متن کاملBayesian non-parametrics for multi-modal segmentation
Segmentation is a fundamental and core problem in computer vision research which has applications in many tasks, such as object recognition, content-based image retrieval, and semantic labelling. To partition the data into groups coherent in one or more characteristics such as semantic classes, is often a first step towards understanding the content of data. As information in the real world is ...
متن کاملMulti-Modal Retrieval for Multimedia Digital Libraries: Issues, Architecture, and Mechanisms
Supporting effective and efficient retrieval of multimedia data is a challenging problem in building a digital library. In this paper, we examine the issues related to accommodating multi-modal retrieval of multimedia data (text, image, video and audio), and propose 2M2Net as a generic framework for such versatile retrieval in multimedia digital libraries. The retrieval is conducted based on th...
متن کاملA Bag of Semantic Words Model for Medical Content-based Retrieval
The bag of visual words model has been widely used in contentbased image retrieval. However, when it is applied to medical domain, it potentially has several limitations, e.g., some ordinary feature descriptors may not be able to capture the subtle characteristics of medical images; there is a semantic gap between the low-level features and the medical concepts; the emerging multi-modal data po...
متن کامل